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ABSTRACT 

A magnitude limited sample of nearly 9000 early-type galaxies, in the redshift range 0.01 < 
z < 0.3, was selected from the Sloan Digital Sky Survey using morphological and spectral criteria. 
The Fundamental Plane relation in this sample is R oc cr 1A9 ± - 05 j-o.75±o.oi m r* band. 
It is approximately the same in the <?*, i* and z* bands. Relative to the population at the 
median redshift in the sample, galaxies at lower and higher redshifts have evolved only little. 
If the Fundamental Plane is used to quantify this evolution then the apparent magnitude limit 
can masquerade as evolution; once this selection effect has been accounted for, the evolution is 
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consistent with that of a passively evolving population which formed the bulk of its stars about 
9 Gyrs ago. One of the principal advangtages of the SDSS sample over previous samples is that 
the galaxies in it lie in environments ranging from isolation in the field to the dense cores of 
clusters. The Fundamental Plane shows that galaxies in dense regions are slightly different from 
galaxies in less dense regions. 

Subject headings: galaxies: elliptical — galaxies: evolution — galaxies: fundamental parameters 
— galaxies: photometry — galaxies: stellar content 

1. Introduction 

This is the third of four papers in which the properties of <~ 9000 early-type galaxies, in the redshift 
range 0.01 < z < 0.3 are studied. Paper I (Bcrnardi et al. 2003a) describes how the sample was selected 
from the SDSS database. The sample is essentially magnitude limited, and the galaxies in it span a wide 
range of environments. Each galaxy in the sample has measured values of luminosity L, size R and surface 
brightness I Q — (Lj2)jR 2 in four bands (g* , r* , i* and z*), a velocity dispersion a, a redshift, and an 
estimate of the local density. Paper II (Bernardi et al. 2003b) shows that the joint distribution of early-type 
galaxy luminosities, radii and velocity dispersions is reasonably well fit by a tri-variate Gaussian. It also 
shows various correlations between pairs of variables, such as the luminosity-velocity dispersion relation, the 
luminosity-size relation, and the relation between radius and surface brightness. Paper IV (Bernardi et al. 
2003c) uses the spectra of these galaxies to provide information on the chemical evolution of the early-type 
population. 

This paper places special emphasis on the Fundamental Plane (FP) relation between size, surface bright- 
ness and velocity dispersion. It shows how the FP depends on waveband, color, redshift and environment. 
In Section 2.1 we compare the results of a maximum likelihood analysis which can account for evolution and 
selection effects, as well as correlations between errors (e.g., Saglia et al. 2001, and Paper II of this series) 
and standard regression estimates, which cannot. Section 2.2 checks if the residuals from the plane correlate 
with any other observables. A discussion of the mass-to-light ratio is provided in Section 2.3. Evidence for 
weak evolution is presented in Section 2.4, and weak trends with environment are found in Section 2.5. The 
distribution of the galaxies in our sample in K-space (Bender, Burstein & Faber 1992) is shown in Section 2.6. 
We summarize our findings in Section 3. 

Except where stated otherwise, we write the Hubble constant as Hq = 100 h kms -1 Mpc -1 , and we 
perform our analysis in a cosmological world model with (Qm,^a, h) = (0.3,0.7,0.7), where JIm and J7a 
are the present-day scaled densities of matter and cosmological constant. In such a model, the age of 
the Universe at the present time is t = 9.43/i _1 Gyr. For comparison, an Einstein-de Sitter model has 
(Dm, ^a) = (1,0) and t = 6.52/i _1 Gyr. We frequently use the notation h 70 as a reminder that we have set 
h = 0.7. Also, we will frequently be interested in the logarithms of physical quantities. Our convention is to 
set R = log 10 R and V = log 10 a, where R and a are effective radii in h^Q kpc and velocity dispersions in 
km s _1 , respectively. 
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2. The Fundamental Plane 

In any given band, each galaxy in our sample is characterized by three numbers: its luminosity, L, 
its size, R , and its velocity dispersion, a. Correlations between these three observables are expected if 
early-type galaxies are in virial equilibrium, because 

2 _ GM V i r _ I M v i r \ n i L/2 



°vir « — OC ( — — ) R mr [ ^2- ) ■ (1) 



vir 



If the size parameter R v i r which enters the virial theorem is linearly proportional to the observed effective 
radius of the light, R a , and if the observed line-of-sight velocity dispersion a is linearly proportional to 
a V ir , then this relates the observed velocity dispersion to the product of the observed surface brightness and 
effective radius. Following Djorgovski & Davis (1987), correlations involving all three variables arc often 
called the Fundamental Plane (FP). In what follows, we will show how the surface brightness, R Q , and a are 
correlated. Because both [i oc — 2.5 \og 10 [(L / 2) / R 2 ] and a are distance independent quantities (this assumes 
that cosmological dimming and K-corrections have been computed correctly), it is in these variables that 
studies of early-type galaxies are usually presented. 



2.1. Finding the best-fitting plane 

The Fundamental Plane is defined by: 

logio Ro = a log 10 cr + b log 10 I + c (2) 

where the coefficients a, b, and c are determined by minimizing the residuals from the plane. There are a 
number of ways in which this is usually done. Let 



Ai ee log 10 R - a log 10 cr - b log 10 I a - c and 

A = __i 

° ~ (1 + a 2 + b 2 ) 1 / 2 ' 



(3) 



Then summing A^ over all N galaxies and finding that set of a, b and c for which the sum is minimized 
gives what is often called the direct fit, whereas minimizing the sum of A 2 instead gives the orthogonal fit. 
Although the orthogonal fit is, perhaps, the more physically meaningful, the direct fit is of more interest if 
the FP is to be used as a distance indicator. 

A little algebra shows that the direct fit coefficients are 

22 22 22 22 

a II a RV ~ a IR a IV v _ a VV a IR ~ a RV a IV 
22 4' 22 4' 

a ii a vv ~ a IV <7 II <7 VV ~ a iv 



c = log 10 R - a log 10 a - b log 10 I a , and 

222 2' 2' 2' o 2 2 2 

/a2\ _ a II a RR a VV ~ a II (T RV ~ a RR a IV ~ a VV IJ IR + ^ a IR a IV a RV ( a\ 

K l} ~ rr 2 a 2 -a 4 ' [ 1 

a II°VV a IV 



where log 10 X = J2 t logi X,/N and a 2 xy = J2t( lo Sw -^i-log 10 -X")(log 10 li-log 10 Y)/N, and X and Y can be 
I , Ro or a. For what follows, it is also convenient to define r xy = a 2 y / '(a xx a y y) . The final expression above 
gives the scatter around the relation. If surface brightness and velocity dispersion are uncorrelated (we will 
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Table 1: Maximum-likelihood estimates, in the four SDSS bands, of the joint distribution of luminosities, sizes 
and velocity dispersions. The mean values of the variables at redshift z are /i* — Qz, i?* , V*, and the elements 
of the covariancc matrix T defined by the various pairwise correlations between the variables are shown. 
These coefficients were obtained from the coefficients of the covariance matrix C shown in Table 1 in Paper II. 



Band 




M* 




i?* 




V* 


OV 


pr^ 




Prv 


Q 


g* 


5825 


20.74 


0.654 


0.520 


0.254 


2.197 


0.113 


0.801 


0.005 


0.536 


1.15 


r* 


8228 


19.87 


0.610 


0.490 


0.241 


2.200 


0.111 


0.760 


0.000 


0.543 


0.85 


i* 


8022 


19.40 


0.600 


0.465 


0.241 


2.201 


0.110 


0.753 


-0.001 


0.542 


0.75 


z* 


7914 


18.99 


0.604 


0.450 


0.241 


2.200 


0.110 


0.759 


-0.001 


0.543 


0.60 



show below that, indeed, cr/y m 0), then a equals the slope of the relation between velocity dispersion and the 
mean size at fixed velocity dispersion, b is the slope of the relation between surface-brightness and the mean 
size at fixed surface-brightness, and the rms scatter is a rr\/1 — r RV — r\ R . Errors in the observables affect 
the measured ai,,, and thus will bias the determination of the best- fit coefficients and the intrinsic scatter 

y 

around the fit. If e xy is the rms error in the joint measurement of log 10 X and log 10 Y, then subtracting 
the appropriate e xy from each a xy before using them provides estimates of the error-corrected values of a, b 
and c. Expressions for the orthogonal fit coefficients can be derived similarly, although, because they require 
solving a cubic equation, they are lengthy, so we have not included them here. 

Neither minimization procedure above accounts for the fact that the sample is magnitude-limited, and 
has a cut at small velocity dispersions. In addition, because our sample spans a wide range of redshifts, wc 
must worry about effects which may be due to evolution. The magnitude limit means that we cannot simply 
divide our sample up into small redshift ranges (over which evolution is negligible) , because a small redshift 
range probes only a limited range of luminosities, sizes and velocity dispersions. To account for all these 
effects, we use the maximum-likelihood approach (e.g. Saglia et al. 2001) described in Paper II. This method 
is the natural choice given that the joint distribution of M = — 2.51og 10 L, R = log 10 R and V = log 10 a 
is quite well described by a multivariate Gaussian. The maximum likelihood estimates of the mean values 
of these variables, and the parameters of the covariance matrix C which describes the correlations between 
these variables are shown in Table 1 of Paper II. What remains is to write down how the covariance matrix 
changes when we change variables from (M, R, V) to (p,, R, V). Because (^ — /U*) = (M — M„) + 5(i? — 
the covariance matrix becomes 

t m + IO&m&rPrm + 250-fj ctrom Prm + 5a R (t v <7 M p V M + §vr<Jv Prv 
T = ( or&m Prm + 5o-fj a R &r<Jv Prv 

owm Pvm + 5erftO-y Prv &r&v Prv Oy 

VRVfiPRfi VVVuPVfi \ 

orVvPr^ Vr <J R <7 V p RV . (5) 
vvVuPVn Oflov Prv Oy / 

The coefficients of T are given in Table 1; they were obtained by inserting the values shown in Table 1 of 
Paper II into the first of the equalities above. Note that p = —2.5 log 10 I Q . 

This matrix is fundamentally useful because it describes the intrinsic correlations between the sizes, 
surface-brightnesses and velocity dispersions of early-type galaxies — the effects of how the sample was selected 
and observational errors have been accounted for. For example, the coefficients in the top right and bottom 
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left of T are very close to zero, indicating that surface brightness and velocity dispersion are uncorrelated. In 
addition, the eigenvalues and vectors of T give information about the shape and thickness of the Fundamental 
Plane. For example, in r* , the eigenvalues are 0.639, 0.179, and 0.052; the smallest eigenvalue is considerably 
smaller than the other two indicating that, when viewed in the appropriate projection, the plane is quite 
thin. The associated eigenvector gives the coefficients of the 'orthogonal' fit, and the rms scatter around this 
orthogonal fit is given by the (square root of the) smallest eigenvalue (e.g. Saglia et al. 2001). 

If we wish to use the FP as a distance indicator, then we are more interested in finding those coefficients 
which minimize the scatter in R a . This means that we would like to find that pair (a, b) which minimize 
(A 2 ), where Ai is given by equation (3). A little algebra shows that the solution is given by inserting 
the maximum likelihood estimates of the scatter in surface-brightnesses, sizes and velocity dispersions into 
equation (4). 

The maximum likelihood T can be used to provide estimates of the direct and orthogonal fit coefficients, 
as well as the intrinsic scatter around the mean relations (orthogonal to the plane as well as in the direction 
of R ). These are given in Table 2. Although b is approximately the same both for the 'orthogonal' and 
the 'direct' fits, a from the direct fit is always about 25% smaller than from the orthogonal fit. In cither 
case, note how similar a and b are in all four bands. This similarity, and the fact that the thickness of the 
FP decreases slightly with increasing wavelength, can be used to constrain models of how different stellar 
populations (which may contribute more or less to the different bands) are distributed in early-type galaxies. 
If the direct fit is used as a distance indicator, then the thickness of the FP translates into an uncertainty 
in derived distances of about 20%. 

Table 2 also shows results from the more traditional x 2 —fitting techniques, which were obtained as 
follows. (These fits were not weighted by errors, and the intrinsic scatter with respect to the fits was estimated 
by subtracting the measurement errors in quadrature from the observed scatter.) Ignoring evolution and 
selection effects when minimizing (A 2 ) and (A 2 ), results in coefficients a which are about 10% larger than 
those we obtained from the maximum likelihood method. We have not shown these in the Table for the 
following reason. If the population at high redshift is more luminous than that nearby, as expected if the 
evolution is passive, then the higher redshift population would have systematically smaller values of /j, . 
Since the higher redshift population makes up most of the large R part of our sample, this could make 
the Plane appear steeper, i.e., it could cause the best-fit a to be biased to a larger value. If we use the 
maximum-likelihood estimate of how the luminosities brighten with redshift, then we can subtract off the 
brightening from /i G before minimizing (A 2 ) and (A 2 ). This reduces the best-fit value of a so that it is 
closer to that of the maximum likelihood method. The coefficients obtained in this way are labeled 'x 2 — 
Evolution' in Table 2; they are statistically different from the maximum likelihood estimates, presumably 
because they do not account for selection effects or for the effects of observational errors. If we weight each 
galaxy by the inverse of S(zi\M*, Q) (the selection function defined in Paper II) when minimizing, then this 
should at least partially account for selection effects. The resulting estimates of a, b and c are labeled 'x 2 — 
Evolution — Selection effects' in Table 2. The small remaining difference between these and the maximum 
likelihood estimates is likely due to the fact that the likelihood analysis accounts more consistently for errors. 

Figure 1 shows the FP in the four SDSS bands. We have chosen to present the plane using the coefficients, 
obtained using the maximum-likelihood method, which minimize the scatter orthogonal to the plane. (In 
all cases, the evolution of the luminosities has been subtracted from the surface brightnesses.) The results 
to follow regarding the shape of the FP, and estimates of how the mean properties of early-types depend on 
redshift and environment, are independent of which fits we use. A fair number of the galaxies in our sample 
have velocity dispersion measurements with small S/N (see, e.g., Figure 19 of Paper I). The FP is relatively 
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Table 2: Coefficients of the FP in the four SDSS bands. For each set of coefficients, the scatter orthogonal 
to the plane and in the direction of R Q are also given. 



Band a b c rms™^ rms ; 
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fits 














Maximum Likelihood 
















9* 


1.45±0.06 


-0.74 ±0.01 -8 


.779 ± 


0. 


,029 


0, 


,056 





.100 


r* 


1.49±0.05 


-0.75 ±0.01 -8 


.778 ± 


0. 


,020 


0. 


,052 





,094 


i* 


1.52±0.05 


-0.78 ±0.01 -8 


.895 ± 


0. 


,021 


0, 


,049 





.091 
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1.51±0.05 


-0.77 ±0.01 -8 


.707 ± 


0. 
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0, 


,049 
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1.43±0.06 


-0.78 ±0.01 -9 


.057 ± 
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,032 
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.719 ± 
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1.48±0.05 
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.699 ± 
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,050 
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9* 


1.35±0.06 


-0.77 ±0.01 -8 


.820 ± 
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,058 





.100 


r* 


1.40±0.05 


-0.77 ±0.01 -8 


.678 ± 


0. 


,023 


0, 


,053 





.092 


i* 


1.41±0.05 


-0.78 ±0.01 -8 


.688 ± 


0. 


,024 


0, 


,050 





.090 


z* 


1.41±0.05 


-0.78 ±0.01 -8 


.566 ± 


0. 


,026 


0. 


.048 





.089 



Direct fits 



Maximum Likelihood 



9* 


1.08±0.05 


-0.74 ±0.01 


-8. 


,033 
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0, 


,024 


0, 


,061 





.092 


r* 


1.17±0.04 


-0.75 ±0.01 


-8, 


,022 
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0, 


,020 


0, 


,056 





.088 


i* 


1.21±0.04 


-0.77 ±0.01 


-8, 


,164 
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0. 


,018 


0, 


,053 





.085 


z* 


1.20±0.04 


-0.76 ±0.01 


-7, 


.995 
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0, 


,021 


0, 


,053 
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Selection effects 


















9* 


1.05±0.05 


-0.79 ±0.01 


-8. 


,268 
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,026 


0, 


,063 





.094 


r* 


1.12±0.04 


-0.76 ±0.01 
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,932 
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,020 





,057 





.088 


i* 


1.14±0.04 


-0.76 ±0.01 
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,904 
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,019 
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,054 
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,784 
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,021 
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,053 
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,020 
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Fig. 1. — The Fundamental Plane in the four SDSS bands. Coefficients shown are those which minimize 
the scatter orthogonal to the plane, as determined by the maximum-likelihood method. Surface-brightnesses 
have been corrected for evolution. 
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Table 3: Coefficients of the FP in the complete and magnitude-limited simulated catalogs, obtained by 
minimizing a x 2 in which evolution in the surface brightnesses has been removed, and which weights objects 
by the inverse of the selection function. 

Band a b c rms ort h rmsjj 

Orthogonal fits 

Complete 

g* 1.44±0.05 -0.74 ± 0.01 -8.763 ±0.028 0.056 0.100 

r* 1.48±0.05 -0.75 ± 0.01 -8.722 ±0.020 0.052 0.094 

Magnitude limited 

g* 1.39±0.06 -0.74 ±0.01 -8.643 ±0.028 0.056 0.100 

r* 1.43±0.05 -0.76 ±0.01 -8.721 ± 0.021 0.052 0.093 

Direct fits 

Complete 

g* 1.09±0.04 -0.74 ± 0.01 -7.992 ±0.023 0.061 0.091 

r* 1.16±0.04 -0.75 ± 0.01 -8.005 ±0.020 0.056 0.088 

Magnitude limited 

g* 1.04±0.05 -0.74 ±0.01 -7.817 ±0.025 0.061 0.090 

r* 1.11±0.04 -0.75 ± 0.01 -7.895 ±0.020 0.056 0.087 



insensitive to these objects: removing objects with S/N < 15 had little effect on the best fit values of a, b. 
Removing objects with small axis ratios also had little effect on the maximum likelihood coefficients. 

In principle, the likelihood analysis provides an estimate of the error on each of the derived coefficients. 
However, this estimate assumes that the parametric Gaussian form is indeed a good fit. Although we present 
evidence in Paper II that the Gaussian form is indeed good, we emphasize that, when the data set is larger 
a non-parametric fit should be performed. Therefore, we have estimated errors on the numbers quoted in 
Table 2 as follows. The large size of our sample allows us to construct many random subsamples, each of 
which is substantially larger than most of the samples available in the literature. Estimating the elements 
of the covariance matrix presented in Table 1, and then transforming to get the FP coefficients in Table 2, 
in each of these subsamples provides an estimate of how well we have determined a, b and c. (Note that the 
errors we find in this way are comparable to those sometimes quoted in the literature, even though each of 
the subsamples we generated is an order of magnitude larger than any sample available in the literature.) 
Because each subsample contains fewer galaxies than our full sample, this procedure is likely to provide an 
overestimate of the true formal error for our sample. However, the formal error does not account for the 
uncertainties in our K-corrections and velocity dispersion aperture corrections (discussed in more detail in 
Paper I), so an overestimate is probably more realistic. 

As a check on the relative roles of evolution and selection effects, we simulated complete and magnitude- 
limited samples (with a velocity dispersion cut) following the procedures outlined in Appendix A of Paper II. 
We then estimated the coefficients of the FP in the simulated catalogs using the different methods. The 
results are summarized in Table 3. When applied to the complete simulations, the x 2_ minimization method 
yields estimates of a which are biased high; it yields the input Fundamental Plane coefficients only after 
evolution has been subtracted from the surface brightnesses. However, in the magnitude limited simulations, 
once evolution has been subtracted, it provides an estimate of a which is biased low, unless selection effects 
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Table 4: Selection of Fundamental Plane coefficients from the literature. 



Source 


Band 


N ea i 


a 


b 


A Ro 


Fit method 


Dressier et al. (1987) 


B 


97 


1.33±0.05 


-0.83 ± 0.03 


20% 


inverse 


Lucey et al. (1991) 


B 


26 


1.27±0.07 


-0.78 ±0.09 


13% 


inverse 


Guzman et al. (1993) 


V 


37 


1.14± - 


-0.79± - 


17% 


direct 


Kelson et al. (2000) 


V 


30 


1.31±0.13 


-0.86 ±0.10 


14% 


orthogonal 


Djorgovski & Davis (1987) 


r G 


106 


1.39±0.14 


-0.90 ±0.09 


20% 


2-step inverse 


J0rgensen et al. (1996) 


r 


226 


1.24±0.07 


-0.82 ±0.02 


19% 


orthogonal 


Hudson et al. (1997) 


R 


352 


1.38±0.04 


-0.82 ±0.03 


20% 


inverse 


Gibbons et al. (2001) 


R 


428 


1.37±0.04 


-0.825 ±0.01 


20% 


inverse 


Colless et al. (2001) 


R 


255 


1.22±0.09 


-0.84 ±0.03 


20% 


ML 


Scodeggio (1997) 


I 


294 


1.55±0.05 


-0.80 ±0.02 


22% 


orthogonal 


Pahre et al. (1998) 


K 


251 


1.53±0.08 


-0.79 ±0.03 


21% 


orthogonal 



are also accounted for. Note that this is similar to what we found with the data. The maximum-likelihood 
method successfully recovers the same intrinsic covariance matrix and evolution as the one used to generate 
the simulations, both for the complete and the magnitude-limited mock catalogs, and so it recovers the same 
correct coefficients for the FP in both cases. (We have not shown these estimates in the Table.) 

A selection of results from the literature is presented in Table 4. Many of these samples were constructed 
by combining new measurements with previously published photometric and velocity dispersion measure- 
ments, often made by other authors. (Exceptions are J0rgensen et al. 1996, Scodeggio 1997, and Colless et 
al. 2001.) With respect to previous samples, the SDSS sample presented here is both extremely large and 
homogeneous. 

Notice the relatively large spread in published values of a, and the fact that a is larger at longer 
wavelengths. In contrast, the Fundamental Plane we obtain in this paper is remarkably similar in all 
wavebands — although our value of b is consistent with those in the literature, the value of a we find in all 
wavebands is close to the largest published values. In addition, the eigenvectors of our covariance matrix 
satisfy the same relations presented by Saglia et al. (2001). Namely, vi = R Q — aV — bl a , v 2 ~ —R /b — 
V(l + b 2 )/(ab)+I and v 3 w R + I /b. And, when used as a distance indicator, the FP we find is as accurate 
as most of the samples containing more than <~ 100 galaxies in the literature. Unfortunately, at the present 
time, we have no galaxies in common with those in any of the samples listed in Table 4, so it is difficult to 
say why our FP coefficients appear to show so little dependence on wavelength, or why a is higher than it 
is in the literature. 

The fact that a ^ 2 means that the FP is tilted relative to the simplest virial theorem prediction 
R oc <r 2 /I . One of the assumptions of this simplest prediction is that the kinetic energy which enters 
the virial theorem is proportional to the square of the observed central velocity dispersion. Busarcllo et al. 
(1997) argue that, in fact, the kinetic energy is proportional to a 16 rather than to a 2 . Since this is close to 
the a 15 scaling we see, it would be interesting to see if the kinetic energy scales with a for the galaxies in 
our sample similarly to how it does in Busarello et al.'s sample. This requires measurements of the velocity 
dispersion profiles of (a subsample of) the galaxies in our sample, and has yet to be done. 

Correlations between pairs of observables, such as the Faber-Jackson (1976) relation between luminosity 
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and velocity dispersion, and the Kormendy (1977) relation between the size and the surface brightness, can 
be thought of as projections of the Fundamental Plane. They are studied in Paper II. The K-space projection 
of Bender, Burstein & Faber (1992) is presented in Section 2.6 below. 

2.2. Residuals and the shape of the FP 

Once the FP has been obtained, there are at least two definitions of its thickness which are of interest. 
If the FP is to be used as a distance indicator, then the quantity of interest is the scatter around the relation 
in the R direction only. On the other hand, if the FP is to be used to constrain models of stellar evolution, 
then one is more interested in the scatter orthogonal to the plane. We discuss both of these below. 

The thickness of the FP is some combination of residuals which are intrinsic and those coming from 
measurement errors. We would like to verify that the thickness is not dominated by measurement errors. 
The residuals from the FP in the different bands are highly correlated; a galaxy which scatters above the FP 
in g* also scatters above the FP in, say, z* . Although the errors in the photometry in the different bands are 
not completely independent, this suggests that the scatter around the FP has a real, intrinsic component. It 
is this intrinsic thickness which the maximum likelihood analysis is supposed to have estimated. The intrinsic 
scatter may be somewhat smaller than the maximum likelihood estimates because there is a contribution to 
the scatter which comes from our assumption that all early-type galaxies are identical when we apply the 
K-correction, for which we have not accounted. 

All our estimates of the scatter around the FP show that the FP appears to become thicker at shorter 
wavelengths. Presumably, this is because the light in the redder bands, being less affected by recent star- 
formation and extinction by dust, is a more faithful tracer of the dynamical state of the galaxy. The 
orthogonal scatter in our sample, which spans a wide range of environments, is comparable to the values 
given in the literature obtained from cluster samples (e.g., Pahre et al. 1998); this constrains models of 
how the stellar populations of early- type galaxies depend on environment. If the direct fit to the FP is 
used as a distance indicator, then the intrinsic scatter introduces an uncertainty in distance estimates of 
- ln(10) x 0.09 - 20%. 

Our next step is to check that the FP really is a plane, and not, for example, a saddle. To do this, we 
should show the residuals from the orthogonal fit as a function of distance along the long axis of the plane. 
Specifically, if X = log 10 a + (b/a) log 10 I a + (c/a), then 

X FP ee XVTT^+ (log 10 R - aX)-JL= = X+ ; l0Sl 2 ^ (6) 

and we would like to know if the residuals A Q defined earlier correlate with Xpp. A scatter plot of these 
residuals versus Xfp is shown in Figure 2 (we have first subtracted off the weak evolution in the surface 
brightnesses) . The symbols superimposed on the scatter plot show the mean value of the residuals and plus 
and minus three times the error in the mean, for a few small bins along Xpp. The figure shows that the FP 
is reasonably flat; it is slightly more warped in the shorter wavelenghts. 

Given that the FP is not significantly warped, we would like to know if deviations from the Plane 
correlate with any of the three physical parameters used to define it. When the plane is defined by minimizing 
with respect to log 10 R a , there is little if any correlation of the residuals with absolute magnitude, surface 
brightness, effective radius, axis-ratio, velocity dispersion, or color so we have chosen to not present them 
here. Instead, Figure 3 shows the result of plotting the residuals orthogonal to the plane when the plane 
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Fig. 2. — Residuals orthogonal to the maximum-likelihood FP fit as a function of distance along the fit (the 
long axis of the plane) . Error bars show the mean plus and minus three times the error in the mean in each 
bin. Galaxies with low/high velocity dispersions populate the upper- left/lower- right of each panel, but the 
full sample shows little curvature. 
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Fig. 3. — Residuals orthogonal to the FP in r* versus absolute magnitude M, surface brightness fj, a , effective 
radius log 10 R , axis ratio b/a, velocity dispersion log 10 a, and (g* — r*) color. Note the absence of correlation 
with all parameters other than velocity dispersion and color. 
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Fig. 4. — The FP in four subsamples defined by velocity dispersion. Solid curve (same in all four panels) 
shows the maximum likelihood relation of the parent r* sample and dashed lines show the best-fit, obtained 
by minimizing the residuals orthogonal to the plane, using only the galaxies in each subsample. The slope 
of the minimization fit increases with increasing velocity dispersion, whereas maximum-likelihood fits to the 
subsamples, which account for the cut on a, give the same slope as for the full sample. 
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is defined by the orthogonal fit. The residuals show no correlation with M, /i , log 10 R , or axis ratio (we 
have subtracted the weak evolution in M and \x a when making the scatter plots). The residuals are anti- 
correlated with log 10 a and slightly less anti-correlated with (g* — r*) color. The correlation with color is 
due to the fact that velocity dispersion and color are tightly correlated (this correlation is studied in more 
detail in Paper IV). The correlation with velocity dispersion is not a selection effect, nor is it associated with 
evolution; we see a similar trend with velocity dispersion in both the complete and the magnitude-limited 
simulated catalogs. 

Figure 4 shows why this happens. The four panels show the FP in four subsamples of the full r* sample, 
divided according to velocity dispersion. Notice how the different scatter plots in Figure 4 show sharp cut- 
offs approximately perpendicular to the x-axis: lines of constant a are approximately perpendicular to the 
x-axis. Whereas the direct fit is not affected by a cut-off which is perpendicular to the x-axis, the orthogonal 
fit is. Hence, the residuals with respect to the orthogonal fit show a correlation with velocity dispersion, 
whereas those from the direct fit do not. (Indeed, by using the coefficients provided in Tables 1 and 2, 
and the definition of the residuals Ai and A„, one can show that (Ai|log 10 cr) is proportional to log 10 a, 
with a constant of proportionality which is close to zero when the parameters for the direct fit are inserted. 
However, when the parameters for the orthogonal fit are used, then the slope of the (A o |log 10 cr) versus 
log 10 a relation is significantly different from zero.) 

To illustrate, the solid curves in Figure 4 (the same in each panel) show the maximum likelihood FP 
for the full sample. The dashed curves show the FP, determined by using the y 2 — method to minimize the 
residuals orthogonal to the plane, in various subsamples defined by velocity dispersion. The panels for larger 
velocity dispersions show steeper relations. Evidence for a steepening of the relation with increasing velocity 
dispersion was seen by J0rgensen et al. (1996). Their sample was considerably smaller than ours, and so 
they ruled the trend they saw as only marginal. Our much larger sample shows this trend clearly. We 
have already argued that this steepening is an artifact of the fact that lines of constant velocity dispersion 
are perpendicular to the x-axis. The maximum-likelihood fit to the subsamples is virtually the same as 
that for the full sample, provided we include the correct velocity dispersion cuts in the normalization of the 
likelihood. In other words, the maximum-likelihood fit is able to account for the bias introduced by making 
a cut in velocity dispersion as well as apparent magnitude. 



2.3. The mass-to-light ratio 

The Fundamental Plane is sometimes used to infer how the mass-to-light ratio depends on different 
observed or physical parameters. For example, the scaling required by the virial theorem, M a oc R a 2 , 
combined with the assumption that the mass-to-light ratio scales as M Q /L oc MJ yields a Fundamental 
Plane like relation of the form: 

iZoOca 2 ^-^ 1 ^)/- 1 /^). (7) 

The observed Fundamental Plane is R cx a a I h . If the relation above is to describe the observations, then 
7 must simultaneously satisfy two relations: 7 = (2 — a)/(2 + a), and 7 = —(1 + b)/b. The values of b in 
the literature are all about —0.8; setting 7 equal to the value required by b and then writing a in terms of b 
gives a = —2(1 + 26). Most of the values of a and b in the shorter wavebands reported in the literature (see, 
e.g., Table 4) are consistent with this scaling, whereas the higher values of a found at longer wavelengths are 
not. Although the direct fits to our sample have small values of a, the orthogonal fits give high values in all 
four bands. These fits do not support the assumption that M Q /L can be parametrized as a function of M Q 
alone. 
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Fig. 5. — Ratio of effective mass R a 2 to effective luminosity (L/2) as a function of luminosity (top left), 
mass (top right), velocity dispersion (middle left), surface brightness (middle right), the combination of 
velocity dispersion and size suggested by the Fundamental Plane (bottom left), and color (bottom right). 
Notice the substantial scatter around the best fit linear relation in the bottom left panel, the slope of which 
is shallower than unity. 
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Another way to phrase this is to note that, when combined with the virial theorem requirement that 
(M /L) oc a 2 /(R I ), the Fundamental Plane relation R Q oc a a I h yields 



^) oc a 2+a / b R^ 1+b V b (8) 



(e.g. J0rgensen et al. 1996; Kelson et al. 2000). The quantity on the right hand side is the mass-to-light 
ratio 'predicted' by the Fundamental Plane, if a and R are given, and the scatter in the Fundamental Plane 
is ignored. This is a function of M alone only if a = —2(1 + 26). Our orthogonal fit coefficients a and 6 arc 
not related in this way. Rather, for our Fundamental Plane, the dependence on a in equation (8) cancels 
out almost exactly: to a very good approximation, we find (M /L)fp oc R a ^ 1+b ^ b oc R® 33 . Alternatively, a 
little algebra shows that mass to light ratio is determined by the combination (a 2 / 1 ) - 25 . Whether there is 
a simple physical reason for this is an open question. 

In contrast to the predicted ratio, (M /L)fp, the combination R a 2 / L is the 'observed' mass-to-light 
ratio. The ratio of the observed value to the FP prediction of equation (8) is (R / I b a a ) 1 ^ b . The scatter 
in the logarithm of this ratio is 1/6 times the scatter in Fundamental Plane in the direction of R a (i.e., it 
is the scatter in the quantity we called Ai in the previous subsections, divided by 6). Inserting the values 
from Table 2 shows that if the values of a and the effective radius in r* are used to predict the values of the 
mass-to- light ratio in r* , then the uncertainty in the predicted ratio is 26%. This is larger than the values 
quoted in the literature for early-type galaxies in clusters (e.g., J0rgensen et al. 1996; Kelson et al. 2000). 

Unfortunately, this is somewhat confusing terminology, because the two mass-to-light ratios are not 
proportional to each other. This can be seen by using the maximum-likelihood results of Table 1 to compute 
the mean of the observed mass to light ratio R a 2 /L at fixed predicted (M/L)fp, or simply by plotting 
the two quantities against one another. Figure 5 shows how R a 2 /L correlates with luminosity, mass R o 2 , 
velocity dispersion, surface brightness, the ratio predicted by the Fundamental Plane, and color. The different 
panels show obvious correlations; the maximum likelihood predictions for these correlations can be derived 
from the coefficients in Table 1: (R a a 2 /L) oc L°- 14±0 - 02 , {R a 2 /L) oc (i? o( r 2 ) a22±0 - 05 , {R a 2 /L) oc (T o.84±o.i ; 
and (R a 2 /L) oc R? o 27±0m . These are shown as dashed lines in the top four panels. A linear fit to the 
scatter plot in the bottom left panel gives (R a 2 /L) oc (M/i) F p 0±0 05 , with an rms scatter around the fit 
of 0.14: the ratio predicted by the Fundamental Plane is not proportional to the observed ratio. A scatter 
plot of (M/L)pp against all these quantities is tighter, of course (recall the scatter around the FP has 
been removed), although some of the slopes are significantly different. For example, (M/L)fp oc 2 j 016 ±0- 04 ) 
(M/L) F p oc (R o a 2 )°- 13±0m , and (M/L) F p oc cr0.2i±o.03. the < b se rved' and 'predicted' slopes of the mass- 
to-light ratio versus a relations are very different. For this reason, one should be careful in interpretting 
what is meant by the 'predicted' mass-to-light ratio. Our own view is that the observed ratio, R a 2 /L is to 
be preferred, as it is directly related to observables, and is independent of the fitting procedure used to fit 
the Fundamental Plane. 



2.4. The Fundamental Plane: Evidence for evolution? 

The Fundamental Plane is sometimes used to test for evolution. This is done by plotting R Q versus 
the combination of /i and a which defines the Fundamental Plane at low redshift, and then seeing if the 
high-rcdshift population traces the same locus as the low redshift population. Figure 6 shows this test for 
our g* band sample: solid lines (same in each panel) show the relation which fits the zero-redshift sample; 
dashed lines show a line with the same slope which best-fits the higher redshift sample. The population at 
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highcr redshift is displaced slightly to the left of the low redshift population; the text in the bottom of each 
panel shows this shift, expressed as a change in the surface brightness fj, a . The plot appears to show that, on 
average, the higher redshift galaxies are brighter, with the brightening scaling approximately as A/x w — 2z. 

How much of this apparent brightening is really due to evolution, and how much is an artifact of the fact 
that our sample is magnitude limited? To address this, we generated complete and magnitude limited mock 
galaxy catalogs as described in Appendix A of Paper II, and then performed the same test for evolution. 
Comparing the shifts in the two simulations allows us to estimate how much of the shift is due to the selection 
effect. Figure 7 shows the results in our simulated g* (left) and r* (right) catalogs. The solid lines in each 
panel show the zero-redshift relation, and the dotted and dashed lines show lines of the same slope which 
best- fit the points at low and high redshift, respectively. The text in the bottom shows how much of the 
shift in Ho is due to the magnitude limit, and how much to evolution. The sum of the two contributions is 
the total shift seen in the magnitude limited simulations. Notice that this sum is similar to that seen in the 
data (Figure 6), both at low and high redshifts, suggesting that our simulations describe the varying roles 
played by evolution and selection effects accurately. Since the parameters of the simulations were set by 
the maximum likelihood analysis, we conclude that the likelihood analysis of the evolution in luminosities 
is reasonably accurate (A/i w —1.15,2) in g* , but we note that this evolution is less than one would have 
infered if selection effects were ignored (A/z D w —2z). 

The importance of selection effects in our sample has implications for another way in which studies of 
evolution are presented. If galaxies do not evolve, then the FP can be used to define a standard candle, 
so the test checks if residuals from the FP in the direction of the surface-brightness variable, when plotted 
versus redshift, follow Tolman's (1 + z) 4 cosmological dimming law. If Friedmann-Robertson- Walker models 
are correct, then departures from this (1 + z) 4 dimming trend can be used to test for evolution. This can 
be done if one assumes that the main effect of evolution is to change the luminosities of galaxies. If so, then 
evolution will show up as a tendency for the residuals from the FP, in the \x direction, to drift away from 
the (1 + z) 4 dimming (e.g., Sandage & Perelmuter 1990; Pahre, Djorgovski & de Carvalho 1996). 

Figure 8 shows this trend in our dataset. The lowest dashed lines in all panels show the expected 
(1 + z) A dimming; panels on the left/right show results in g* jr* . Consider the top two panels first. The 
points show residuals with respect to the zero-redshift Fundamental Plane in our sample. The crosses show 
the median residual in a small redshift bin. The galaxies do not quite follow the expected (1 + z) dimming. 
The similarity to the (1 + z) A dimming argues in favour of standard cosmological models, whereas the small 
difference from the expected trend is sometimes interpretted as evidence for evolution (e.g., J0rgensen et al. 
1999; van Dokkum et al. 1998, 2001; Trcu et al. 1999, 2001a,b). 

Of course, to correctly quantify this evolution, we must account for selection effects. The dashed lines 
which lie between the (1 + z) A scaling and the data (i.e., the crosses) show how the surface brightness should 
scale if there were passive evolution of the form suggested by the maximum likelihood analysis, but there 
were no magnitude limit. That is, if M*(z) = M*(0) — Qz, then the surface brightnesses should scale as 
(1 + z) 4 ~ - 92 Q. The solid curves show the result of making the measurement in simulated magnitude limited 
catalogs which include this passive evolution. Notice how different these solid curves are from the dashed 
curves (they imply Q about twice the correct value), but note how similar they are to the data. This shows 
that about half of the evolution one would naively have infered from such a plot is a consequence of the 
magnitude limit. 

To further emphasize the strength of this effect, we constructed simulations in which there was no 
evolution whatsoever. We did this by first making maximum likelihood estimates of the joint luminosity, 
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Fig. 6. — The g* FP in four redshift bins. The slope of the FP is fixed to that at zero redshift; only the 
zero-point is allowed to vary. The zero-point shifts systematically with redshift. The same plot for r* shows 
similar but smaller shifts. 



2.0 
1.5 

1.0 

0.5 

O.Q 
0.5 

■1.0 




{z — o.oe) Evoi-a =-D.04i ±a.aoa 

SeFJ-i ^ =-0.061 ±0,003 
{* - 0.20) Evol-A = -0.285*0.02,3 
SeLf-4 ^ =■ -0.103+0.023 




(? ?- 0.06) Fwoi-A ^ 

(2 r 0.20) Ewal-4 ^ 
Sa£f -A ^ 4 



•D.E.iCJ iO-OCB 
-0.0 J 9 ±0.009 
-0',?G6 +0.010 

-cos at o.o is 



1.5 2.0 2,5 3,0 3.5 1,5 2.0 2.5 3;0 3.5 
Log 1fl tr * 0.20 CMq " 20.56) Lcg 10 $ f 0.20{^ o - 15.95) 



Fig. 7. — The FP in the g* (left panel) and r* (right panel) magnitude-limited mock catalogs. Solid line 
shows the FP at z = 0. Dotted and dashed lines show fits using a low and high redshift subsample only. 
For these fits, the slope of the FP is required to be the same as the solid line; only the zero-point is allowed 
to vary. The shift seen in the complete simulations is labeled 'Evol— A[x \ whereas the shift seen in the 
magnitude limited simulations is the sum of this and the quantity labeled 'SeEf— A/i D '. This sum is similar 
to the shift seen in the SDSS data, suggesting that selection effects are not negligble. 
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Fig. 8. — Residuals of the zero-redshift FP with respect to the surface brightness, before correcting for 
cosmological dimming, versus redshift in the four bands. Lowest dashed line in all panels shows the (1 + z) A 
dimming expected if there is no evolution. Solid curves in top panels show the same measurement in mock 
simulations of a magnitude limited sample of a passively evolving population. Dashed lines in between show 
the actual evolution in surface brightness — the difference between these and the solid curves is an artifact of 
the magnitude limit. Bottom panel shows the same test applied using the parameters of the Fundamental 
Plane which best describes the data if there is required to be no evolution whatsoever. Solid lines show what 
one would observe in a magnitude limited sample of such a population. In this case, the entire trend away 
from the (1 + z) 4 dimming is a selection effect. Note how, once the magnitude limit has been applied, both 
the evolving (top) and non-evolving populations (bottom) appear very similar to our observed sample. 
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size and velocity dispersion distribution in which no evolution was allowed. (For the reasons discussed earlier, 
the associated no-evolution Fundamental Plane coefficient a is steeper by about 10%.) This was then used 
to generate mock catalogs in which there is no evolution. The crosses in the bottom panels show the result 
of repeating the same procedure as in the top panels, but now using the parameters of the no-evolution 
Fundamental Plane, and the solid line shows the measurement in the no-evolution simulations in which, by 
construction, the population of galaxies at all redshifts is identical. Therefore, the shifts from the (1 + z) 4 
dimming we see in the magnitude limited no-evolution catalogs (solid curves in bottom panels) are entirely 
due to the magnitude limit. Notice how similar the solid lines from our no-evolution simulations are to the 
actual data. If we believed there really were no evolution, then the results shown in the bottom panel would 
lead us to conclude that much of the trend away from the (1 + z) 4 dimming is a selection effect — it is not 
evidence for evolution. 

(The fact that we were able to find a non-evolving population which mimics the observations so well 
suggests that the population of early-type galaxies at the median redshift of our sample must be rather 
similar to the population at lower and at higher redshifts. This, in turn, can constrain models of when the 
stars in these galaxies must have formed.) 

We view our no-evolution simulations as a warning about the accuracy of this particular test of evolution. 
If the evolution is weak, then it appears that the results of this test depend critically on how the catalog 
was selected, and on what one uses as the fiducial Fundamental Plane. To make this second point, we 
followed the procedure adopted by many other recent publications. Namely, we assumed that the zero- 
rcdshift Fundamental Plane has the shape reported by J0rgensen et al. (1996) for Coma, for which a is 
about 15% smaller than what we find in g*. If no account is taken of selection effects, then the inferred 
evolution in [i Q results in a value of Q which is about a factor of four times larger than the one we report in 
Table 1! 

Our results indicate that inferences about evolution which are based on this test depend uncomfortably 
strongly on the strength of selection effects, and on what one assumes for the fiducial shape of the Funda- 
mental Plane. In this respect, our findings about the role of, and the need to account for selection effects 
are consistent with those reported by Simard et al. (1999). While we believe we have strong evidence that 
the early-type population is evolving, we do not believe that the strongest evidence of this evolution comes 
from either of the tests presented in this subsection. Nevertheless, it is reassuring that the evolution we see 
from these Fundamental Plane tests is consistent with that which we estimated using the likelihood analysis 
in Paper II, and is also consistent with what we use to make our K-corrections. Namely, a passively evolving 
population which formed the bulk of its stars about 9 Gyrs ago appears to provide a reasonable description 
of the evolution of the surface brightnesses in our sample. 

2.5. The Fundamental Plane: Dependence on environment 

This section is devoted to a study of if and how the properties of early-type galaxies depend on environ- 
ment. Paper I describes our working definition of environment — essentially, we use the number of galaxies 
which are nearby in color-, angular- and redshift-space as an indication of the local density. Our procedure 
for assigning neighbours is least secure in the lowest redshift bin (typically z < 0.08). 

Paper I shows that when the number of near neighbours is small, the luminosities, sizes and velocity 
dispersions all increase slightly as the local density increases, whereas the surface brightnesses decrease 
slightly, although all these trends are very weak. A more efficient way of seeing if the properties of galaxies 
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Fig. 9. — Residuals from the FP as a function of number of nearby neighbours. Stars, circles, diamonds, 
triangles, squares and crosses show averages over galaxies in the redshift ranges z < 0.075, 0.075 < z < 0.1, 
O.K z < 0.12, 0.12 < z < 0.14, 0.14 < z < 0.18 and z > 0.18. 
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depend on environment is to show the residuals from the Fundamental Plane. As wc argue below, this 
efficiency comes at a cost: if the residuals correlate with environment, it is difficult to decide if the correlation 
is due to changes in luminosity, size or velocity dispersion. 

Figure 9 shows the differences between galaxy surface brightnesses and those predicted by the zcro- 
redshift maximum likelihood FP given their sizes and velocity dispersions, as a function of local density. 
Stars, circles, diamonds, triangles, squares and crosses show averages over galaxies in the redshift ranges 
z < 0.075, 0.075 < z < 0.1, 0.1 < z < 0.12, 0.12 < z < 0.14, 0.14 < z < 0.18 and z > 0.18. Error bars 
show the error in determining the mean. (For clarity, the symbols have been offset slightly from each other.) 
The plot shows that the residuals depend on redshift — we have already argued that this is a combination of 
evolution and selection effects. Notice, that in all redshift bins, the residuals tend to increase as local density 
increases. This suggests that the mean residual from the Fundamental Plane depends on environment. If 
the offset in surface brightness is interpretted as evidence that galaxies in denser regions are slightly less 
luminous than their counterparts in less dense regions, then this might be evidence that they formed at higher 
redshift. While this is a reasonable conclusion, wc should be cautious: because /i — [1fj>(R ,<j) = — Ai/6, 
what we have really found is that the residuals in the direction of R correlate with environment. Because 
a — <Tpp(R , fi ) = — Ai/a, we might also have concluded that the velocity dispersions of galaxies in dense 
regions are systematically different from those of galaxies which have the same sizes and luminosities but are 
in the field. For similar reasons, a plot of the mean residual orthogonal to the plane shows a dependence on 
environment. (However, the rms scatter in the orthogonal direction of the residuals around the mean residual 
in each density bin, when plotted as a function of density, shows no trend.) Thus, while the Fundamental 
Plane suggests that the properties of galaxies depend on environment, it does not say how. 



2.6. The K-space projection 

Bender ct al. (1992) suggested three simple combinations of the three observables: 

log 10 (i? o a 2 ) 
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which, they argued, correspond approximately to the FP viewed face-on (k2-Ki), and the two edge-on 
projections (k 3 -Ki and k 3 -k 2 ). They also argued that their parametrization was simply related to the 
underlying physical variables. For example, Hi oc mass and k 3 oc the mass-to-light ratio. The Ki~k 2 
projection would view the FP face on if R a oc <r 2 /I ; recall that we found R oc (a 2 / 1 ) - 75 . 

Bender ct al.'s choice of parameters was criticized by Pahre et al. (1998) on the grounds that if the 
effective radius R is a function of wavelength, then the 'mass' becomes a function of wavelength, which 
is unphysical. On average, the effective radii of the galaxies in our sample do increase with decreasing 
wavelength (Paper I), so one might conclude that Pahre et al.'s objections are valid. However, recall that 
we do not use the measured velocity dispersion directly; rather, a represents the value the dispersion would 
have had at some fixed fraction of R a . If R a depends on wavelength, and we wish to measure the velocity 
dispersion at a fixed fraction of R 01 then one might argue that we should also correct the measured velocity 
dispersion differently in the different bands. The velocity dispersion decreases with increasing radius. So, 
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Fig. 10. — The early-type sample in the four SDSS bands viewed in the /c-space projection of Bender et al. 
(1992). The dashed line in the upper right corner of each panel shows k,\ + K2 = 8, what Burstein et al. 
(1997) termed the 'zone of avoidance'. 
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if R is larger in the blue band than the red, then the associated velocity dispersion we should use in 
the blue band should be smaller than in the red. One might imagine that the combination R a 2 remains 
approximately constant after all. Whether or not it does, we have chosen to present the SDSS early- type 
sample in the K-space projection introduced by Bender et al. 

Figure 10 shows the results for the four SDSS bands. Because the mean surface brightness depends on 
waveband, we set log 10 / = 0.4(27 — yU Q + (fi a ) — (fi g }) when making the plots, so as to facilitate comparison 
with Bender et al. (1992) and Burstein et al. (1997). The dashed line in the upper right corner of each 
panel shows k± + K2 = 8, what Burstein et al. termed the 'zone of avoidance'. Had we not accounted for the 
fact that the mean surface brightness is different in the different bands, then the galaxies would populate 
this zone. 

The magnitude limit is clearly visible in the lower right corner of the K3-K1 projection; we have not 
made any correction for it. When the sample is split by color, the redder galaxies appear to follow a tighter 
relation than the bluer galaxies, and they also tend to lie slightly closer to the zone of avoidance. We leave 
quantifying and interpreting these trends to future work. 

3. Discussion and conclusions 

We have studied the Fundamental Plane populated by ~ 9000 early-type galaxies over the redshift 
range < z < 0.3 in the g*, r* , i* and z* bands. If this Fundamental Plane is defined by minimizing the 
residuals orthogonal to it, then R Q oc (T 1 - 5 /^ 77 (see Table 2 for the exact coefficients). The Fundamental 
Plane is remarkably similar in the different bands (Figure 1), and appears to be slightly warped in the shorter 
wavebands (Figure 2). Residuals with respect to the direct fit (i.e., the FP defined by minimizing the residuals 
in the direction of log 10 R ) do not correlate with either velocity dispersion or color, whereas residuals from 
the orthogonal fit correlate with both (Figures 3). This correlation with a is simply a projection effect (see 
Figure 4 and related discussion), whereas the correlation with color is mainly due to the fact that color and 
a are strongly correlated (Paper IV). The Fundamental Plane is intrinsically slightly thinner in the redder 
wavebands. This thickness is sometimes expressed in terms of the accuracy to which the FP can provide 
redshift-independent distance estimates — this is about 20%. If the thickness is expressed as a scatter in the 
mass-to- light ratio at fixed size and velocity dispersion, then this scatter is about 30%. 

The simplest virial theorem prediction for the shape of the Fundamental Plane is that R oc a 2 / I Q . This 
assumes that the observed velocity dispersion a 2 is proportional to the kinetic energy a 2 ir which enters the 
virial theorem. Busarello et al. (1997) argue that in their data log 10 a = (1.28 ± 0.11) log 10 a v i r — 0.58, so 
that a 15 oc a^f 2 . Since the coefficient of a in the Fundamental Plane we find in all four bands is <~ 1.5, it 
would be interesting to see if the kinetic energy for the galaxies in our sample scales as it did in Busarello et 
al.'s sample. To do this, measurements of the velocity dispersion profiles of (a subsample of) the galaxies in 
our sample are required. 

Tests for passive luminosity evolution which use the Fundamental Plane are severly affected by selection 
effects and the choice of the fiducial Fundamental Plane against which to measure the evolution (Figures 6- 
8). These tests suggest that the surface brightnesses of galaxies at higher redshifts in our sample are brighter 
than those of similar galaxies nearby. The amount of brightening is consistent with the luminosity evolution 
estimated in Paper II. 

The way in which galaxies scatter from the Fundamental Plane correlates weakly with their local envi- 
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ronment (Figure 9). If this is caused entirely by differences in surface brightness, then galaxies in overdense 
regions are slightly fainter. If so, then single-age stellar population models suggest that early-type galaxies 
in denser regions formed at higher rcdshift. However, it may be that, the velocity dispersions are higher in 
denser regions (Paper II). A larger sample is necessary to make a more definitive statement. 

By the time the Sloan Digital Sky Survey is complete, the uncertainty in the K-corrections, which 
prevent us at the present time from making more precise quantitative statements about the evolution of the 
luminosities and colors, will be better understood. In addition, the size of the sample will have increased by 
more than an order of magnitude. This will allow us to provide a more quantitative study of the effects of 
environment than we are able to at the present time. 
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